Making a Cloud Provenance-Aware
نویسندگان
چکیده
The advent of cloud computing provides a cheap and convenient mechanism for scientists to share data. The utility of such data is obviously enhanced when the provenance of the data is also available. The cloud, while convenient for storing data, is not designed for storing and querying provenance. In this paper, we present desirable properties for distributed provenance storage systems and present design alternatives for storing data and provenance on Amazon’s popular Web Services platform (AWS). We evaluate the properties satisfied by each approach and analyze the cost of storing and querying provenance in each approach.
منابع مشابه
The Application of Cloud Computing to the Creation of Image Mosaics and Management of Their Provenance
We have used the Montage image mosaic engine to investigate the cost and perfonnance of processing images on the Amazon EC2 cloud, and to infonn the requirements that higher-level products impose on provenance management technologies. We will present a detailed comparison of the perfonnance of Montage on the cloud and on the Abe high perfomlance cluster at the ational Center for Supercomputing ...
متن کاملEnergy Aware Resource Management of Cloud Data Centers
Cloud Computing, the long-held dream of computing as a utility, has the potential to transform a large part of the IT industry, making software even more attractive as a service and shaping the way IT hardware is designed and purchased. Virtualization technology forms a key concept for new cloud computing architectures. The data centers are used to provide cloud services burdening a significant...
متن کاملCollecting Provenance via the Xen Hypervisor
The Provenance Aware Storage Systems project (PASS) currently collects system-level provenance by intercepting system calls in the Linux kernel and storing the provenance in a stackable filesystem. While this approach is reasonably efficient, it suffers from two significant drawbacks: each new revision of the kernel requires reintegration of PASS changes, the stability of which must be continua...
متن کاملUsing Cloud-Aware Provenance to Reproduce Scientific Workflow Execution on Cloud
Provenance has been thought of a mechanism to verify a workflow and to provide workflow reproducibility. This provenance of scientific workflows has been effectively carried out in Grid based scientific workflow systems. However, recent adoption of Cloud-based scientific workflows present an opportunity to investigate the suitability of existing approaches or propose new approaches to collect p...
متن کاملData Provenance in Distributed Propagator Networks
Existing distributed programs often require provenance to be included in the design of the distributed computing framework. Distributed programs making use of data propagation do not have this restriction; propagator networks allow non-provenance-aware applications to be easily transformed into provenance-aware forms by simply modifying existing program structure.
متن کامل